The shift-invariant discrete wavelet transform and application to speech waveform analysis.
نویسندگان
چکیده
The discrete wavelet transform may be used as a signal-processing tool for visualization and analysis of nonstationary, time-sampled waveforms. The highly desirable property of shift invariance can be obtained at the cost of a moderate increase in computational complexity, and accepting a least-squares inverse (pseudoinverse) in place of a true inverse. A new algorithm for the pseudoinverse of the shift-invariant transform that is easier to implement in array-oriented scripting languages than existing algorithms is presented together with self-contained proofs. Representing only one of the many and varied potential applications, a recorded speech waveform illustrates the benefits of shift invariance with pseudoinvertibility. Visualization shows the glottal modulation of vowel formants and frication noise, revealing secondary glottal pulses and other waveform irregularities. Additionally, performing sound waveform editing operations (i.e., cutting and pasting sections) on the shift-invariant wavelet representation automatically produces quiet, click-free section boundaries in the resulting sound. The capabilities of this wavelet-domain editing technique are demonstrated by changing the rate of a recorded spoken word. Individual pitch periods are repeated to obtain a half-speed result, and alternate individual pitch periods are removed to obtain a double-speed result. The original pitch and formant frequencies are preserved. In informal listening tests, the results are clear and understandable.
منابع مشابه
Efficient Algorithms for Invariant Discrete Wavelet Decomposition
Classical discrete wavelet packet transforms are sensitive to changes in image orientation and translation. Therefore, it is hardly possible to extract rotation invariant features from images in the transform domain. This paper proposes several algorithms for invariant discrete wavelet decomposition to produce an invariant representation for an image. The procedure can be divided into several s...
متن کامل【the Invention】 [discrete Wavelet Transform Based Multiple Template-matching for Speech Recognition]
【Abstract】 This invention is method of speech recognition that consists of plural overlapped templatematching (TM). In order to economize calculations of TM, the data of low resolution given by Haar discrete wavelet transform (HDWT) is used. Templates of wavelet coefficient (WC) on waveform are used for recognition of phoneme. A sum of WC in a scale (SWC) corresponds to a frequency component on...
متن کاملAN INTELLIGENT FAULT DIAGNOSIS APPROACH FOR GEARS AND BEARINGS BASED ON WAVELET TRANSFORM AS A PREPROCESSOR AND ARTIFICIAL NEURAL NETWORKS
In this paper, a fault diagnosis system based on discrete wavelet transform (DWT) and artificial neural networks (ANNs) is designed to diagnose different types of fault in gears and bearings. DWT is an advanced signal-processing technique for fault detection and identification. Five features of wavelet transform RMS, crest factor, kurtosis, standard deviation and skewness of discrete wavelet co...
متن کاملPixel - Level Fusion of Image Sequences using Wavelet Frames
In this paper we propose a novel approach to the pixel level fusion of spatially registered image sequences. This fusion method incorporates a shift invariant extension of the discrete Wavelet Transform, based on the concept of Wavelet Frames which yields an overcomplete signal representation. The advantage of the proposed fusion method is the improved temporal stability and consistency of the ...
متن کاملNoise Reduction Using an Undecimated Discrete
A new nonlinear noise reduction method is presented that uses the discrete wavelet transform. Similar to Donoho and Johnstone, we employ thresholding in the wavelet transform domain but, following a suggestion by Coifman, we use an undecimated, shift-invariant, nonorthogonal wavelet transform instead of the usual orthogonal one. This new approach can be interpreted as a repeated application of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The Journal of the Acoustical Society of America
دوره 117 4 Pt 1 شماره
صفحات -
تاریخ انتشار 2005